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HIER-GRP.  A COMPUTER  PROGRAM  FOR  THE 
HIERARCHICAL  GROUPING  OF  REGRESSION  EQUATIONS 


I.  INTRODUCTION 

HIER-GRP,  an  acronym  for  hierarchical  grouping,  is  a computer  program  which  was  developed  for 
various  Air  Force  research  purposes  at  the  Computational  Sciences  Division,  Air  Force  Human  Resources 
Laboratory,  Brooks  AFB,  Texas.  Given  a starting  set  of  k regression  equations,  each  of  which  contains  the 
same  criterion  and  predictor  variables,  the  basic  objective  of  the  HIER-GRP  algorithm  is  to  group  or  to 
cluster  the  equations  in  a stepwise  or  iterative  manner  so  as  to  minimize  the  overall  loss  of  predictive 
efficiency  at  each  iteration.  Initially  there  are  k separate  groups;  i.e.,  each  of  the  k equations  is  considered 
as  a group  by  itself,  and  a measure  of  overall  predictive  efficiency  is  computed.  At  thi  ,'irst  iteration  all 
possible  ways  of  combining  any  two  of  the  equations  from  the  total  k equations  are  examined,  and  that 
combination  providing  the  minimum  loss  of  overall  predictive  efficiency  is  selected  to  form  a “new  group.” 
Formation  of  the  new  group  reduces  the  number  of  equations  to  k-1  for  the  start  of  the  second  iteration. 
The  process  continues  until  only  one  final  group  remains  and  is  “hierarchical”  in  the  sense  that  the  pattern 
of  the  number  of  groups  from  start  to  finish  is  k,  k-1 , k-2, . . .,  1 . 

The  mathematical  theory  upon  which  HIER-GRP  is  based  is  documented  in  an  Air  Force  publication 
entitled  An  Iterative  Technique  for  Clustering  Criteria  Which  Retains  Optimum  Predictive  Efficiency  by 
Robert  A.  Bottenberg  and  Raymond  E.  Christal  (3).  Early  developmental  work  was  also  accomplished  by 
Joe  H.  Ward,  Jr.,  (16),  and  some  of  the  original  programming  was  done  by  Daniel  D.  Rigney. 

HIER-GRP  or  one  of  the  earlier  versions  of  the  program  has  been  used  extensively  by  the  Air  Force  in 
the  past,  especially  in  conjunction  with  “policy -capturing  applications.”  Policy-capturing  is  a methodology 
composed  of  multiple  linear  regression  analysis  and  hierarchical  grouping  procedures  (1,  3, 4, 6,  7, 14, 16, 
17,  and  18).  In  this  context,  HIER-GRP  was  used  in  the  development  of  the  Weighted  Airman  Promotion 
System  (WAPS)  (10)  and  later  in  the  reevaluation  of  WAPS  (12  and  13).  The  program  was  also  used  in 
developing  officer  grade  requirements  (9),  a promotion  system  for  airman  basics  (2),  a screening  system  for 
the  Air  Reserve  Forces  (8),  and  a senior  NCO  promotion  system  (11). 

This  report  describes  the  technical  details  that  are  required  for  the  use  of  the  HIER-GRP  program  as 
it  is  currently  operational  on  the  Univac  1 108  computer  system  at  the  Computational  Sciences  Division. 
The  basic  algorithm  is  first  discussed,  and  the  essential  steps  are  outlined.  Details  of  the  computer  system 
requirements  and  descriptions  of  necessary  control  cards  are  then  presented.  Next,  the  output  of 
HIER-GRP  is  explained.  Appendices  are  included  that  contain  the  mathematical  formulas  used  in  the 
program,  some  mathematical  background  helpful  for  understanding  the  algorithm,  sample  output,  and  a 
complete  source  card  listing  of  the  program. 

Partly  as  a result  of  the  research  studies  referenced  above,  requests  for  copies  of  the  HIER-GRP 
computer  program  and  associated  documentation  from  different  Air  Force  agencies,  other  governmental 
organizations,  colleges,  and  universities  have  been  numerous.  Since  1969,  approximately  twenty  copies  of 
HIER-GRP  have  been  provided  to  different  requesters  and  implemented  on  a variety  of  different  computer 
systems.  One  purpose  of  this  report  is  to  provide  a document  which  can  be  used  to  satisfy  any  future 
requests  for  HIER-GRP. 


The  basic  steps  of  the  HIER-GRP  algorithm  can  be  summarized  as  the  following  five  phases:  (a)  data 
input  and  program  termination,  (b)  computation  of  the  overlap  matrix,  (c)  determination  of  the  order  of 
clustering,  (d)  computation  of  the  statistics  for  the  initial  k criteria,  and  (e)  iteration  to  reduce  the  number 
of  criteria.  Each  of  these  phases  is  described  in  the  following  steps.  The  steps  are  to  be  followed  in  numeric 
order  unless  indicated  otherwise. 

Steps  1-2.  Data  Input  and  Program  Termination 


1.  Read  “Problem  Definition  Card.”  This  card  defines  k,  the  number  of  criteria  or  regression 
equations  to  be  grouped  and  the  number  of  standardized  regression  (beta)  weights  in  each  equation.  If  no 
Problem  Definition  Card  is  read,  terminate  the  program. 

2.  Read  in  the  number  of  cases,  the  criterion  means  and  standard  deviations,  the  standardized 
regression  weights,  the  validities,  and  the  predictor  means  and  standard  deviations  for  each  equation.  Assign 
each  equation  the  identification  numbers  1 through  k,  respectively,  according  to  the  order  in  which  the 
equations  were  read. 


Step  3.  Computation  of  the  Overlap  Matrix 


3.  Compute  the  overlap  matrix  A,  where  each  element  a^j  denotes  the  decrease  in  overall  predictive  ■ 

efficiency  if  equation  i is  combined  with  equation  j;  for  i = 1,  2,  . . .,  k,  j = 1,  2,  . . .,  k,  and  i =£  j.  The  ' 

diagonal  elements  of  A are  undefined  and  the  elements  above  the  diagonal  are  symmetric  with  those 

elements  below  the  diagonal. 

Steps  4-8.  Determination  of  the  Order  of  Clustering 

4.  Set  NGRPS,  the  index  denoting  the  current  number  of  groups,  equal  to  k.  Initially  each  criterion 
(equation)  belongs  to  a separate  cluster. 

5.  Considering  all  clusters  present  at  the  NGRPS  stage,  select  two  of  the  clusters  denoted  by  i and  j 
such  that : 

a.  where  £ and  m are  the  identification  numbers  of  any  cluster  present  at  the  NGRPS 
stage  and  £ # m,  and 

b.  i <j.  This  can  be  accomplished  by  examining  the  elements  above  the  diagonal  of  the  overlap 
matrix  and  selecting  the  smallest  element. 

6.  Form  a new  criterion  cluster  from  the  old  clusters  i and  j identified  in  Step  5.  Record  the 
identifications  of  the  two  clusters  i and  j in  the  storage  areas  lUjqGRPS  an<*  ^ NGRPS’  respectively  • Assign 
the  new  cluster  the  identification  number  i. 

7.  Decrement  NGRPS  by  1.  If  NGRPS  > 1,  go  to  Step  8;otherwise  proceed  to  Step  9. 

8.  Update  the  overlap  matrix  as  follows.  For  each  £,  £ i of  Step  6 where  £ is  the  identification 

number  of  a criterion  cluster  present  at  the  NGRPS  stage,  compute  the  decrease  in  overall  predictive  i 

efficiency  if  equation  £ is  combined  with  equation  i.  Since  NGRPS  was  reduced  by  1 in  Step  7,  the 
dimension  of  the  updated  overlap  matrix  will  be  reduced  by  1 . Return  to  Step  5. 

Step  9,  Computation  of  the  Statistics  for  the  Initial  k Criteria 

9.  Compute  the  squared  multiple  correlation  coefficient  for  each  of  the  initial  k reqression  equations 
and,  also,  ORU^,  the  overall  squared  multiple  correlation  coefficient  obtained  by  considering  a regression 
model  with  no  grouping  of  initial  equations. 
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Steps  10—15.  Iteration  to  Reduce  the  Number  of  Criteria 


10.  Form  an  initial  grouping  of  the  k equations  by  assigning  each  equation  to  a group  by  itself.  This 
is  the  “k  groups”  stage.  Set  NGRPS  equal  to  k. 

1 1.  Form  a new  grouping  of  the  k equations  by  following  the  grouping  order  established  in  Steps 
4—8.  This  is  accomplished  by  combining  the  groups  identified  by  IUngrPS  ^ ^NGRPS  ^ ass>8n'ng 
the  new  group  (criterion  cluster)  the  identification  number  in  lUjsJGRPS' 

12.  Compute  the  least  squares  regression  equation  which  can  be  used  to  predict  the  new  group  and 
decrement  NGRPS  by  1. 

13.  Print  all  statistics  concerning  ^ie  new  grouping  including: 

a.  the  identification  numbers  of  the  two  equations  combined  at  this  iteration, 

b.  An  F value  testing  the  difference  between  the  prediction  equations  for  the  two  clusters  in 

(a), 

c.  An  F value  testing  the  difference  between  the  k initial  prediction  equations  and  the  smaller 
set  of  NGRPS  equations  (one  for  each  cluster)  used  at  the  “NGRPS  groups”  stage,  and 

d.  the  overall  squared  multiple  correlation  coefficient  obtained  using  the  NGRPS  equations  at 

this  stage. 

14.  Print  a summary  of  all  groups  (clusters)  present  at  the  NGRPS  stage.  Also,  print  the  prediction 
equation  for  the  new  group  (including  standardized  and  raw  score  weights). 

15.  If  NGRPS  > 1,  loop  back  to  Step  1 1;  otherwise,  return  to  Step  1 and  begin  the  next  problem. 


III.  DESCRIPTIONS  OF  THE  HIER-GRP  PROGRAM 
Systems  Requirements 


The  HIER-GRP  program  is  composed  of  seven  routines— the  main  or  driver  routine  and  six 
subroutines.  The  entire  program,  with  the  exception  of  the  Univac  Assembly  Language  subroutine  START, 
is  written  in  FORTRAN  IV.  The  assembly  subroutine  START  is  called  once  at  the  beginning  of  the  driver 
routine  and  is  never  called  again.  Its  only  function  is  to  reset  the  margin  control  on  the  Univac  1108 
printer. 

The  Univac  version  of  FORTRAN  has  a special  statement,  the  Parameter  statement,  which  is  used  in 
the  driver  routine  and  which  may  not  be  available  on  other  computers.  The  Parameter  statement  is  used  to 
define  the  dimensions  of  arrays  at  compilation  time.  The  Parameter  statement  can  be  removed  if  each  array 
is  dimensioned  to  its  required  size. 

The  complete  HIER-GRP  program  requires  approximately  10,000  36-bit  words  of  core  storage  in 
addition  to  the  number  of  words  required  for  arrays.  If  P is  the  number  of  predictors  and  E is  the  number 
of  equations,  then  the  amount  of  storage  required  for  arrays  is  1 2E+3P+  [2*  E*  P]  + (E*(E — I )/2 ] + 14 . For 
example,  if  P = 50  and  E = 50,  then  6,989  words  of  storage  are  required  for  arrays. 

There  are  a total  of  1,121  cards  in  the  HIER-GRP  program  deck.  Of  these,  only  601  are  source 
language  cards  and  the  remainder  are  comments  cards.  The  number  of  cards  and  the  intrinsic  system 
routines  required  in  each  of  the  seven  routines  which  make  up  HIER-GRP  are  listed  in  Table  1 . 
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Table  1.  Characteristics  of  the  HIER-GRP  Routines 


Program 

Name 

Source 

Language 

Number  of  Source 
Language  Cards 

Number  of 
Comment  Cards 

Intrinsic  System 
Routines  Required 

DRIVER  (MAIN) 

FORTRAN  IV 

100 

311 

None 

START 

ASSEMBLY 

7 

0 

None 

OVRLP 

FORTRAN  IV 

36 

36 

None 

GROUP 

FORTRAN  IV 

76 

48 

None 

STAGE 

FORTRAN  IV 

81 

42 

None 

PRINTG 

FORTRAN  IV 

218 

82 

SQRT 

PLEVEL 

FORTRAN  IV 

83 

1 

ATAN,  SQRT, 
ALOG,  EXP,  SIN 

Data  Requirements 


A HIER-GRP  user  must  supply  the  following  data  for  each  regression  equation: 

1 . The  number  of  cases  (N)  which  were  used  to  compute  the  equation 

2.  The  criterion  mean  and  standard  deviation  (SD) 

3.  The  standardized  regression  weights 

4.  The  validity  coefficients  (correlations  of  predictor  or  independent  variables  with  the  criterion  or 
dependent  variable) 

5.  The  predictor  means  and  standard  deviations. 

The  computational  formulas  developed  by  Bottenberg  and  Christal  (3)  and  used  within  the  program 
assume  that  the  predictor  sums-of-squares  and  cross-products  matrices  are  proportional;  i.e.,  that  the  ratios 
of  the  corresponding  elements  of  the  sums-of-squares  and  cross-products  matrices  for  any  two  equations  to 
be  clustered  are  equal  to  the  ratio  of  the  corresponding  numbers  of  the  cases  within  each  equation.  This 
assumption  of  proportionality  is  discussed  in  detail  by  Bottenberg  and  Christal  (1961,  see  pages  8 through 
1 1)  and  also  addressed  in  Appendix  B (see  equation  9b)  of  this  report.  In  practice  this  assumption  is  met  by 
selecting  items  (1)  and  (5)  of  the  previous  paragraph  to  be  identical  for  each  equation. 

Run-Stream  Organization 


The  following  card  sequence  is  required  to  use  the  HIER-GRP  program  as  it  is  operational  on  a 
Univac  1 108  computer: 

Order  Card  Type 

1.  @RUN 

2.  @XQT  T*T.HIER-GRP 

3.  Problem  Definition  Card 

4.  Header  Card(s) 

5.  Format  Card  for  Equation  Ns 

6.  Data  Card(s)  — Equation  Ns 

7.  Format  Card  for  Criterion  Means  and  SDs 

8.  Data  Card(s)  - Criterion  Means  and  SDs 

9 . Format  Card  for  Beta  Weights 

10.  Data  Card(s)  — Beta  Weights 

1 1 . Format  Card  for  Validities 
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12.  Data  Card(s)  - Validities 

13.  Format  Card  for  Predictor  Means  and  SDs 

14.  Data  Card(s)  — Predictor  Means  and  SDs 

15.  The  sequence  of  cards  3—14  is  required  for  each  run. 

As  many  problems  as  desired  may  be  run  by  stacking  one 
problem  after  another. 

16.  Blank  Card  to  Terminate  Run 

17.  @FIN 

The  Univac  1108  System  Cards  (1,2,  and  17)  are  described  in  the  Univac  Exec  8 Reference  Manual 
(15).  Descriptions  of  cards  3—16  are  presented  in  the  next  section.  See  Appendix  C for  sample  run-stream 
and  sample  control  cards. 


Control  Cards 


Problem  Definition  Card 
Card  FORTRAN 


Columns 

Format 

Description 

1-3 

13 

NEQS, 

the  number  of  criteria  (systems,  regression 
equations)  in  this  problem.  NEQS  must  be 
less  than  or  equal  to  50. 

4-6 

13 

NPREDS, 

the  number  of  beta  weights  (standardized 
regression  weights)  in  each  equation. 

NPREDS  must  be  less  than  or  equal  to  100. 

7 

11 

IOPT, 

the  grouping  (clustering)  option  desired. 

Normally  a “6”  is  specified  which  causes 
the  grouping  to  be  done  based  on  the 
iterative  technique  developed  by  Bottenberg 
and  Christal  (3).  Other  options  are 
included  in  the  program  and  comments  cards, 
but  are  for  future  developmental  purposes  only. 

8 

11 

NHDRS, 

the  number  of  header  (label,  title)  cards 
that  follow  this  control  card.  Header  cards 
can  be  omitted  (NHDRS  = 0)  or  up  to  9 cards 
may  be  specified. 

9 

11 

IRE  AD, 

the  data  read  option.  IREAD  = 0 means  read  the 
beta  weights  and  validities  NPREDS  items  at 
a time.  IREAD  = 1 means  read  them  NEQS*NPREDS 
items  at  a time.  IREAD  allows  flexibility  in 
the  format  of  input  data.  However,  IREAD  is 
normally  set  equal  to  zero. 

10-80 

These  card  columns  are  not  read. 

Header  Cards 

Each  header  card  will  be  printed  only  once  at  the  beginning  of  the  grouping  report.  Exactly  NHDRS 
header  cards  must  be  present. 


Format  and  Data  Cards 

1.  Format  Card  for  Equation  Ns.  This  card  supplies  the  FORTRAN  variable  format  by  which  the 
number  of  cases  used  in  the  computation  of  each  equation  is  to  be  read.  Only  the  F and  X editing  codes  are 
permitted. 

2.  Data  Card(s)  - Equation  Ns.  These  cards  are  read  according  to  the  previous  format  card.  The 
number  of  cards  required  depends  on  the  format  specifications. 

3.  Format  Card  for  Criterion  Means  and  SDs.  This  card  provides  the  FORTRAN  variable  format  by 
which  the  criterion  mean  and  standard  deviation  for  each  equation  are  to  be  read.  Only  the  F and  X editing 
codes  are  permitted. 

4.  Data  Cardfs ) - Criterion  Means  and  SDs.  These  cards  are  read  according  to  the  previous  format 
card.  The  number  of  cards  required  depends  on  the  format  specifications. 

5.  Format  Card  for  Beta  Weights.  This  card  supplies  the  FORTRAN  variable  format  by  which  the 
beta  weights  (NPREDS  weights  per  equation)  are  to  be  read.  Only  the  F and  X editing  codes  are  permitted. 

6.  Data  Cardfs)  - Beta  Weights.  These  cards  are  read  according  to  the  previous  format  card.  Exactly 
NEQS  sets  of  cards  are  required  if  1READ  = 0.  The  first  set  contains  the  beta  weights  for  equation  1 ; the 
second  set  contains  the  beta  weights  for  equation  2;  and  so  on.  The  number  of  cards  within  each  set 
depends  on  the  format  specifications. 

7.  Format  Card  for  Validities.  This  card  provides  the  FORTRAN  variable  format  by  which  the 
validity  coefficients  for  each  equation  are  read.  Only  the  F and  X editing  codes  are  permitted. 

8.  Data  Cardfs)  - Validities.  These  cards  are  read  according  to  the  previous  format  card.  Exactly 
NEQS  sets  of  cards  are  required  if  IREAD  = 0.  The  first  set  contains  the  validities  for  equation  1;  the 
second  set  contains  the  validities  for  equation  2;  and  so  on.  The  number  of  cards  within  each  set  depends 
on  the  format  specifications. 

9.  Format  Card  for  Predictor  Means  and  SDs.  This  card  supplies  the  FORTRAN  variable  format  by 
which  the  predictor  means  and  standard  deviations  for  each  equation  are  to  be  read.  Only  the  F and  X 
editing  codes  are  permitted. 

10.  Data  Cardfs)  - Predictor  Means  and  SDs.  These  cards  are  read  according  to  the  previous  format 
card.  The  number  of  cards  required  depends  on  the  format  specifications. 

Output 


The  printed  output  of  HIER-GRP  is  divided  into  five  parts  - the  monogram  and  version  date,  the 
control  card  parameters,  the  problem  header  label,  the  format  and  input  data  cards,  and  the  criterion 
grouping  results.  Each  of  these  divisions  is  described  in  the  following  paragraphs.  Refer  to  Appendix  C for 
sample  output. 

Monogram  and  Version  Date 

The  program  title  “Hierarchical  Grouping  Program  HIER-GRP,”  the  AFHRL  monogram,  and  the 
program  version  date  are  printed  at  the  beginning  of  each  problem.  The  program  version  date  is  the  last 
time  the  program  was  updated  or  modified. 

Control  Card  Parameters 

The  parameters  specified  on  the  Problem  Definition  card  are  printed  under  the  heading  CONTROL 
CARD  PARAMETERS.  This  includes  the  number  of  regression  equations  (criteria),  the  number  of  beta 
weights  in  each  equation,  the  grouping  option  desired,  and  the  number  of  header  cards  for  this  problem. 
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Problem  Header  Label 


. 

The  problem  header  label,  if  header  cards  were  specified  on  the  Problem  Definition  Card,  is  printed 
1 under  the  heading  PROBLEM  HEADER  LABEL. 

Format  and  Input  Data  Cards 

All  format  cards  and  all  input  data  are  printed  under  the  heading  FORMAT  CARDS  AND  INPUT 
DATA.  First,  the  format  statements  used  to  read  the  number  of  cases  and  the  criterion  means  and  standard 
deviations  for  each  equation  are  printed.  A table  listing  the  equation  numbers,  the  number  of  cases,  the 
criterion  means,  and  the  criterion  standard  deviations  is  printed  next.  Third,  the  format  statement  used  to 
read  the  beta  weights  and  a table  listing  the  equation  number  and  the  beta  weights  (IS  per  line)  for  each 
equation  are  printed.  Fourth,  the  format  statement  used  to  read  the  validity  coefficients,  and  a table  listing 
the  equation  number  and  the  validities  (15  per  line)  for  each  equation  are  printed.  Finally,  the  format 
statement  used  to  read  the  predictor  means  and  standard  deviations  and  a table  listing  the  predictor  variable 
number  and  predictor  means  and  standard  deviations  (one  each  per  line)  are  printed. 

Criterion  Grouping  Results 

The  results  of  the  clustering  process  are  printed  under  the  heading  HIERARCHICAL  GROUPING 
RESULTS.  The  output  in  this  division  can  be  separated  into  three  parts  - the  grouping  option  description, 
the  R-square  (RSQ)  summary  for  the  NEQS  initial  criteria,  and  the  results  of  each  iteration.  Each  of  these 
sections  is  described  as  follows. 

1.  Grouping  Option  Description.  The  grouping  option  and  a verbal  description  of  the  grouping 
option  specified  on  the  Problem  Definition  Card  are  printed. 

2.  RSQ  Summary  for  the  NEQS  Initial  Criteria.  The  number,  NEQS,  of  initial  criteria;  the  overall 
RSQ,  ORUjjgQjj,  achieved  by  using  the  beta  weights  specified  on  the  input  data  cards;  and  a table  listing 
the  equation  number  and  the  RSQ  for  each  equation  are  printed. 

3.  Results  of  Each  Iteration.  The  statistics  and  tables  printed  at  each  iteration,  i.e.,  the  information 
printed  below  each  row  of  asterisks  is  listed  as  the  following  in  Table  2. 
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Table  2.  Output  for  Each  Iteration 


I = 

Computer  Output 

Label  Meaning 

Stage  = 8 8 is  the  number  of  criterion  clusters  present  at  the  end  of 

this  iteration. 

OVERALL  RSQ  = ORUg  This  is  the  RSQ  obtained  by  using  8 equations  (one  for 

each  criterion  cluster  present  at  this  stage)  to  predict  the 
NEQS  initial  criteria. 


SYSTEMS  GROUPING  THIS  STAGE  Table 
SYS  IDENT 

NO.  MEMBERS 


NO.  CASES 


RSQ 

DECISION  VALUE 

F-TEST  FOR  THE  EQUALITY  OF 
REGRESSION  PARAMETERS  FOR 
SYS’S  COMBINED  AT  THIS 
STAGE  Table 

I 

CHANGE  FROM  8+1  SYSTEMS 
RSQ  = ORUe+1  - ORUg 
DF  = NPREDS+1 

RESIDUAL 

, RSQ  = 1— ORUg+1 

DF  = N— (8+1XNPREDS+1) 


The  identification  (ID)  numbers  of  the  two  criterion 
clusters  combined  at  this  iteration. 

The  number  of  members  in  each  criterion  cluster.  The 
ID  numbers  of  the  members  of  each  cluster  can  be 
obtained  by  referring  to  the  summary  roster  for  stage 
8+1. 

The  number  of  cases  used  in  the  computation  of  the 
prediction  equation  for  each  criterion  cluster.  This 
number  is  the  sum  of  the  number  of  cases  used  in  the 
prediction  equation  for  each  member  of  the  cluster. 

The  squared  multiple  correlation  coefficient  which  is 
obtained  by  predicting  each  criterion  within  a cluster 
from  the  same  compromise  regression  equation. 

The  loss  associated  with  replacement  of  the  two  clusters 
combined  at  this  stage. 

This  table  outlines  a test  of  the  hypothesis  that  the 
prediction  equations  for  the  two  criterion  clusters 
combined  at  this  stage  are  identical.  Equivalently,  it  is  a 
test  of  the  loss  in  predictive  efficiency  when  each 
criterion  within  the  two  clusters  combined  at  this  stage 
are  predicted  from  the  same  compromise  equation. 

The  decrease  in  OVERALL  RSQ  from  stage  8+1 . 

The  decrease  in  the  number  of  parameters  estimated 
from  stage  8+1. 

The  proportion  of  the  criterion  variance  attributable  to 
error  at  stage  8+1. 

The  total  number  of  cases  less  the  number  parameters 
estimated  at  stage  8+1.  Equivalently,  DF  is  the  number 
of  degrees  of  freedom  associated  with  the  error  portion 
of  the  criterion  variance  at  stage  8+1 . 


FSTAT  = [(ORUs+1-ORUg)/(NPREDS+l)] 

/ [(1  -ORUg+k)/(N— (8+1 XNPREDS+1 ))] 

The  F statistic  testing  the  hypothesis  described  in  the 
preceding  paragraph  (FOR  SYS’S  COMBINED  AT  THIS 
STAGE) 
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Table  2.  ( Continued ) 


j I 

i 


> 


Computer  Outout 

Label 

Meaning 

SIG  LVL 


F-TEST  FOR  THE  EQUALITY  OF 
REGRESSION  PARAMETERS  FOR 
SYS’S  COMBINED  UP  TO  THIS 
STAGE  Table 


CHANGE  FROM  NEQS  SYSTEMS 
RSQ  = ORU^gQs  — ORUg 
DF  = (NEQS— 8)(NPREDS+ 1 ) 


The  probability  that  a value  of  the  F statistic  greater 
than  FSTAT  would  occur  by  chance.  A value  of  SIG 
LVL  equal  to  a means  that  if  the  hypothesis'being  tested 
is  true,  then  a value  of  the  F statistic  greater  than 
FSTAT  would  have  occurred  100a  percent  of  the  time 
by  chance.  Therefore,  small  values  of  a tend  to  reject  the 
hypothesis  being  tested. 

This  table  outlines  a test  of  the  hypothesis  that  the 
prediction  equations  for  all  members  of  criterion  cluster 
number  1 are  identical,  the  prediction  equations  for  all 
members  of  criterion  cluster  2 are  identical,  and  so  on 
for  the  8 criterion  clusters  present  at  the  end  of  this 
iteration.  Equivalently,  this  tests  the  loss  in  predictive 
efficiency  when  8 equations  (one  for  each  criterion 
cluster)  are  used  to  predict  the  NEQS  initial  criteria 
instead  of  the  original  NEQS  equations. 


The  decrease  in  OVERALL  RSQ  from  stage  NEQS. 

The  decrease  in  the  number  of  parameters  estimated 
from  stage  NEQS. 


RESIDUAL 


RSQ  - 1-ORUNEqs 

DF  = N— (NEQSXNPREDS+1) 


FSTAT  = [(ORUneqs  - ORUg)/(NEQS— BXNPREDS+1)] 

/ [(1  -ORUNEQS)/(N-(NEQSXNPREDS+l))] 

The  F statistic  testing  the  hypothesis  described  in  the 
preceding  paragraph  (FOR  SYS’S  COMBINED  UP  TO 
THIS  STAGE) 

The  probability  that  a value  of  the  F statistic  greater 
than  FSTAT  would  occur  by  chance.  A value  of  SIG 
LVL  equal  to  a means  that  if  the  hypothesis  being  tested 
true,  then  a value  of  the  F statistic  greater  than 
;STAT  would  have  occurred  100a  percent  of  the  time 
by  chance.  Therefore,  small  values  of  a tend  to  reject  the 
hypothesis  being  tested. 

The  summary  roster  is  a listing  of  all  the  criterion 
clusters  present  at  the  end  of  the  current  iteration.  The 
members  and  the  RSQ  for  each  cluster  are  also  printed. 
In  addition,  the  prediction  equation  and  the  system 
mean  and  standard  deviation  for  the  new  criterion 
cluster  formed  at  the  present  iteration  are  printed.  The 
compromise  equation  for  each  criterion  cluster  present 
at  a given  iteration  can  be  obtained  by  referring  to  the 
summary  roster  for  the  stage  at  which  the  cluster  was 
formed. 


SIG  LVL 


SYSTEMS  SUMMARY  ROSTER  Table 


The  proportion  of  the  criterion  variance  attributable  to 
error  at  stage  NEQS. 

The  total  number  of  cases  less  the  number  of  parameters 
estimated  at  stage  NEQS.  Equivalently,  DF  is  the 
number  of  degrees  of  freedom  associated  with  the  error 
portion  of  the  criterion  variance  at  stage  NEQS. 
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Table  2.  ( Continued ) 


I 

; i 

> 

> 


Computer  Output 
Label 


Meaning 


STAGE  ID ENT 
SYS  LOSS 


NO.  MEMBERS 


RSQ 

NO.  CASES 


SYS IDENT 


IDENTIFICATION  OF 
OTHER  MEMBERS 

NEW  SYS  CRITERION  MEAN 

NEW  SYS  CRITERION  SD 

BETA  WEIGHTS  FOR  THE 
NEW  SYSTEM  S 


RAW  SCORE  WEIGHTS  FOR  THE 
NEW  SYSTEM  S 

REGRESSION  CONSTANT 


The  stage  at  which  each  criterion  cluster  was  formed. 

The  contribution  of  each  criterion  cluster  to  the 
decrease  in  OVERALL  RSQ  from  stage  NEQS. 
Equivalently,  this  is  the  amount  by  which  the 
OVERALL  RSQ  would  increase  if  each  of  the  criteria 
within  this  cluster  were  predicted  from  their  individual 
regression  equations  rather  than  from  the  compromise 
equation  for  the  cluster. 

The  number  of  criteria  within  each  criterion  cluster.  The 
ID  numbers  of  the  members  of  each  cluster  are  listed 
under  the  headings  SYS  IDENT  and  IDENTIFICATION 
OF  OTHER  MEMBERS  in  this  table. 

The  squared  multiple  correlation  coefficient  which  is 
obtained  by  predicting  each  criterion  within  a cluster 
from  the  same  compromise  regression  equation. 

The  number  of  cases  used  in  the  computation  of  the 
compromise  equation  for  a criterion  cluster.  This 
number  is  the  sum  of  the  number  of  cases  used  to 
compute  the  regression  equation  for  each  criterion 
within  the  cluster. 

The  ID  number  of  a criterion  cluster.  This  is  also  the 
smallest  ID  number  of  the  criteria  within  this  cluster. 

The  ID  numbers  of  the  remaining  criteria  within  a 
cluster. 

The  criterion  mean  for  the  cluster  formed  at  this 
iteration. 

The  criterion  standard  deviation  for  the  cluster  formed 
at  this  iteration. 

The  values  (10  per  line)  of  the  least  squares  standardized 
regression  coefficients  for  the  regression  equation  which 
is  the  best  single  predictor  for  all  the  criteria  in  the  new 
cluster  where  S is  the  ID  number  of  the  new  cluster. 
Equivalently,  these  are  the  beta  weights  which  would  be 
obtained  by  pooling  the  observations  for  all  the  criteria 
in  the  new  cluster  and  computing  the  regression  of  the 
pooled  criteria  on  the  NPREDS  predictor  variables. 

The  values  (5  per  line)  of  the  raw  score  weights  for  the 
regression  equation  which  is  the  best  single  predictor  for 
all  the  criteria  in  the  new  cluster  S. 

The  regression  constant  for  the  regression  equation 
which  is  the  best  single  predictor  of  all  the  criteria  in  the 
new  cluster. 

A list  of  the  identification  numbers  of  the  “Y”  single 
criteria  which  have  not  been  combined  with  any  system 
up  to  this  stage. 
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APPENDIX  A : NOTATION  AND  COMPUTATIONAL  FORMULAS 

The 

k.  The 
p,  The 
n^,  The 
nij.  The 
a*,  The 
Sj,  The 

The 

3,,  The 

Cj,  The 

for  criterion  i. 

N,  The  total  number  of  cases  N = nj  +n2  + . . . +nk 
mQ  The  pooled  criterion  mean  NmQ  = ni  mt  +n2 m2  + . . . -t-n^m^ 
aQ,  The  pooled  criterion  variance 

No*  = n, (oi  +m* ) + . . . +nk(o£+m£)  - Nm* 
gj,  The  number  of  criteria  in  cluster  I. 

l,  The  set  of  criteria  in  cluster  1. 1 = j it , i2, . . *gj}.  In  the  succeeding  definitions,  let  I be  the  union 
of  clusters  J and  L,  J U L. 

Nj,  The  number  of  cases  used  in  the  computation  of  the  composite  equation  for  cluster  I. 

Nj  = Z nj  = Nj+Nl 
id 

Mj,  The  criterion  mean  for  cluster  I. 

NjMj  = 2 njirij  = NjMj+N^M^ 
iel 

o|,  The  criterion  variance  for  cluster  I. 

NjOj  = 1 nj(a*+m?)-NjM|  =Nj(oj+Mj)+Nl(ol+M^)-NjM| 
iel 

Sj,  The  constant  term  <n  the  regression  eauation  for  cluster  I. 

NI<*I  = E "i«i  = NJ«J+NL“L 
iel 

A 

bj,  The  pxl  vector  of  regression  weights  for  cluster  I. 

Nib  j = £ njbj  = Njbj  + NLbL 
iel 


transpose  of  the  associated  matrix, 
initial  number  of  criteria, 
number  of  variables. 

number  of  cases  used  in  the  computation  of  the  regression  equation  for  criterion  i. 
mean  for  criterion  i. 
variance  for  criterion  i. 

constant  term  in  the  regression  equation  for  criterion  i. 

pxl  vector  of  regression  weights  for  criterion  L 

pxl  vector  of  standard  regression  weights  for  criterion  i. 

pxl  vector  of  validities  (intercorrelations  between  the  criterion  and  the  p independent  variables) 
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0j,  The  pxl  vector  of  standard  regression  weights  for  cluster  I. 

NI°I^I  = £ "iVi  = NJ°J0J  + NL°lA 
iel 

Cj,  The  pxl  vector  of  validities  for  cluster  I. 


N1°IC1  = £ WH  = NJaJcJ  + nL°LcL 
iel 

2 

Rj,  The  squared  multiple  correlation  coefficient  for  the  regression  on  criterion  i. 
2 * 1 
*i’  = ¥i 

2 

Rj,  The  squared  multiple  correlation  coefficient  for  the  regression  on  cluster  I. 

RI  = ^Icl=-r',  r t^<7JRJ+NL<’LBL+tJJNLc,J0L^JcL^LcJ*l 

Njo|L 

Gs,  The  set  of  s criterion  clusters  present  at  the  s cluster  stage. 


$R2,  The  squared  multiple  correlation  coefficient  for  the  criterion  grouping,  Gs,  at  the  s cluster  stage. 

Na’sR*  = £ N,  (o|  R j +M j ) — Nm  * 
k°s 

Let  Gs  = | J,  L,  K3 , . . K$  | and 
Gs_j  = j J U L,  K3, . . .,  Ks  [then 

SR2  - S_1  Rz  = NJNL  [ OjRjHRl^Mj-Ml)2  ] 

No2(Nj+Nl) 
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APPENDIX  B:  MATHEMATICAL  BACKGROUND 


Mathematical  Model  for  the  Clustering  Algorithm 

Suppose  that  a set  of  p independent  variables,  v'  = (v,,  . . . Vp),  are  linearly  related  to  the  expected 
values  of  each  of  k criteria,  Y, , . . . Y^;  that  is, 

(1)  E(Yj|v)  = v'bj  + aj  for  i=l, . . .,  k 

where  bj  is  a pxl  vector  of  unknown  population  parameters  and  Oj  is  an  unknown  population  constant.  Let 
yj  be  an  njxl  vector  of  independent  observations  on  criterion  Yj,  let  Xj  be  an  n^xp  matrix  of  observations  on 
the  set  of  p independent  variables  v,  where  the  j— th  element  of  yj  corresponds  to  the  j — th  row  of  Xj,  and 
let  Uj  be  an  njxl  vector  in  which  each  element  is  1 . Then  from  (1), 

(la)  EfylXj)  = Xjbj  + u jOCj  for  i=l , . . .,  k. 

Let  N = nj+  . . . +n^;  let  Y'=jy; , . . .,  y^j,  the  lxN  vector  obtained  by  pooling  all  the  criterion  observations; 

u,  X,  0 0...  0 

X = 0 u2X2  0 0 

0 ukXk_ 

the  Nxk(p+1)  block  diagonal  matrix  obtained  by  placing  the  njx(p+l)  matrix  of  observations  RijXjJin  the 
i-th  block  diagonal  position,  and  let  b'=£*ib',  . . . a^bH,  the  k(p+l)  vector  of  unknown  parameters  Under 
the  assumption  that  the  criterion  observations  are  independent  and  have  common  variance,  the 
mathematical  model  for  the  clustering  algorithm  is 

(lb)  E(Y|X)  = Xb  with  D(Y|X)  = o2 1, 

where  D(Y|X)  is  the  dispersion  matrix  of  the  criterion  observations,  a 2 is  the  common  variance,  and  I is  the 
NxN  identity  matrix. 

Minimum  Variance  Unbiased  Estimation  and  Hypothesis  Testing 


The  k(pH)xl  vector  b of  unknown  parameters  in  (lb)  correspond  to  the  k equations  in  (la).  The 
minimum  variance  unbiased  estimates  (mvue),  and  6j,  of  <*j  and  bj  are  obtained  from  (lb)  by  the  method 
of  least  squares,  where 


bj  = [XjXj  - nj  XjUjUjXjJ  "I  [Xjyj  - nj  X^y,] 

(2>  „ X , X > A 

ai = ni  uiYi  - "i 


for  i=l k. 


These  are  the  estimates  that  would  be  obtained  by  the  method  of  least  squares  from  the  k separate  models 

(3)  E(yj  |Xj)  = Xjbj+UjOj  with  Dfy  IXj)  = a1 1 for  i=l .... , k 

where  the  error  variance,  a2 , is  the  same  for  each  model.  It  might  be  that  some  or  all  of  the  equations  in  (1) 
are  identical.  The  technique  of  homogeneity  of  regression  can  be  used  to- test  the  equality  of  vectors  of 
regression  parameters  across  several  criteria.  Chipman  and  Rao  (1964)  and  Theil  (1970)  have  developed 
methods  for  obtaining  mvue  under  general  linear  restrictions  and  for  testing  general  linear  hypotheses.  Rao 
(1965,  pp  189-190)  shows  that  in  the  case 

(4)  E(Y  |X)  = Xb  with  D(Y  IX)  = o2 1, 

A 

where  X is  nxs  of  rank  s and  b is  sxl,  the  mvue,  bq,  for  b under  the  linear  restriction 
(4a)  44)  = 0 is 

(4b)  bq,  = B(B'X'XBr1B'X'Y 
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where  4*  is  rxs  of  rank  r,  B is  sx(s-r)  of  rank  (s-r),  and  'I'B  = 0.  Rao  obtains  this  result  by  introducing  the 
general  solution,  B0,  where  d is  an  (s-r)xl  vector  of  new  parameters,  of  (4a)  into  (4)  to  obtain  the  model 

(5)  E(Y  IX)  = XB0  with  D(Y|X)  = a2 1 

•A  A 

and  no  restrictions  on  6.  The  mvue,  B0,  of  B0  is  B0  (see  Rao,  1965,  pp.  181  -182),  where  0 is  the  mvue  of 
0 in  (5).  If,  in  addition  to  (4),  Y has  a multivariate  normal  distribution,  then  Chipman  and  Rao  develop  an 
expression  for  an  unbiased  critical  region  of  size  0 for  the  following  hypothesis: 

(6)  4'  I b = 0 given  that  'i/Qb  = 0 

I I 

where  4^  is  r,  xs  of  rank  r, , 4'Q  is  rQxs  of  rank  r0,  and  4''  =|4'04'i]  is  sx  (r0+ri)  of  rank  (r0+Tt).  The 
expression  for  the  unbiased  critical  region  of  size  0 is 

<7)  >’’>  I’ 

R^o 

where  Fg  (r,  ,n— s+rQ)  is  the  upper  100  (1—6)%  point  of  the  central  F distribution  with  q and  n— s+r0 
degrees  of  freedom,  and 

ESSH  = (Y-Xb*  )'(Y-Xb*  ), 

„ o „ o 

EXSS  = (Y-Xb^,)'(Y -Xb^,)  - ESSH, 

n 

bo,  is  the  mvue  of  b under  the  restriction  4'  b = 0, 
o u 

A u 

b^,  is  the  mvue  of  b under  the  restriction  4»b  = 0, 

2 

Rj,  is  the  squared  multiple  correlation  under  the  restriction 
o 

4'Qb  = 0,  and 

R|,  is  the  squared  multiple  correlation  under  the  restriction 
4»b  = 0. 

The  Chipman  and  Rao  computational  form  for  F is  different  from  the  form  in  (7),  but  the  two  are 
equivalent.  (See  Rao,  1965,  pp.  199-200). 


MVUE  for  a Criterion  Ouster 


The  restriction  oq  - ot2  = . . . = Oj  and  bi  - b2  = . . . = bj  can  be  expressed  in  the  form  4'b  = 0 as 


(8)  (t-l)(p+l) 


I -I 

0 .... 

• 

-10.. 

0 

. 0 

• 

0 

I 0 

...  0 

-10...  0 

t(p+l) 

(k— t)  (p+1] 

where  I is  the  (p+i )x(p+l)  identity  matrix.  To  express  model  (lb)  in  a form  similar  to  equation  (5)  under 
the  above  restriction  (8),  the  k(p+l)x(k-t+lXp+l)  matrix  B,  where 


1 
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t(p+l)  (k-t)(p+l) 


B'  = 


I 0 
0 I 


0 


(k — 1+1)  (p+ 1) 


and  the  (k-t+IXp+l)  vector  of  new  parameters  6,  where 
9'  = [oq,b*at+1  bt+1  ...akbkJ  , 
is  introduced  into  (lb)  to  yield  the  model 


(9)  E(Y|X)  = 


u,X, 

“ — 

Q\j/ 

0 

bq, 

at+l 

W 

bt+l 

“t+iA+i 

ak 

O 

L bk  _ 

with  D(Y|X)  = o2I. 


The  effect  of  B is  to  pool  the  observations  for  criteria  1,  . . t.  The  mvue  aq,  and  bq,,  for  the  criterion 
cluster  (1,  2,  . . t)  formed  from  criteria  1,  . . t can  be  calculated  in  either  of  two  ways:  pool  the 
observations  as  in  (9)  and  compute  Sty  and  bq,  from  the  normal  equations 

1 


(9a) 


n,  u,X, 


XiU|  X,X. 


+ ...+ 


ntutXt 

VtX*t 


Oq,| 

bq, 


uiyi 

X^y, 


+ . . . + 


r^y? 

Vt 


or  if  the  predictor  sums-of-squares  and  cross-product  matrices  are  proportional,  i.e., 

j~n2  u2  X2 

_1_ 
n2 


(9b) 


ni 


"i  U)Xi 

X,u,  X.X, 


X2  u2  X2  X2 


"t 


ntutXt 

Vt  XtXj 


then  aq,  and  bq,  can  be  calculated  from  Sq , bi , . . .,  at,  and  bt  given  in  (2)  without  forming  the  sum  of 
matrices  on  the  left  hand  side  in  (9a).  Using  (9b)  this  sum  of  matrices  is 


(9c) 


ni  U|  X, 


X,u,  X,X, 


+ . . . + 


"t^Xt 

Vt 


= N* 


"i 


"i  uiXj 

Vi  XjXj 


where  Nt  =n,  +n2+  . . ■ +nt.  Using  (9c)  the  solution  of  (9a)  is 


for  i = 1,  ...  , t 


A 
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.¥t¥. 


-l 


"iuixi 

xTujxjXj 

Xjyj 

n,  u,X, 


X,u,  X,X, 


+ . . . + 


Thus,  the  mvue  for  a criterion  cluster  are 


Vi 


F 1 

aq. 

“a  “ 
<*i 

Jh 

“t' 

A 

= Nt 

A 

+ . . . + Nt 

rs 

bq, 

_bL 

Lbi 

When  (9b)  holds,  the  formula  for  the  standardized  regression  weights  for  a criterion  cluster  is  easy  to 

obtain.  Let  $q,,  /?, , . . .,  be  the  standardized  weights  corresponding  to  the  raw  weights  bq,,  b, let 

Qj  be  the  pxp  diagonal  matrix  with  its  elements  equal  to  the  standard  deviations  calculated  from  the 
observation  matrix  Xj  for  the  p independent  variables;  let  Qq,  be  the  pxp  diagonal  matrix  with  its  elements 
equal  to  the  standard  deviations  calculated  from  the  pooled  observation  matrix  [X,  X2  . . . X^]'for  the  p 
independent  variables;  and  let  o^,  of,  ....  a*,  be  the  sample  variances  for  the  vectors  of  criterion 
observations  [yi  yj  . . . y.l  ’,  yt  . . .,  yt,  respectively.  By  definition  the  standardized  weights  are 


= Q'pt,W  . 0,  = Qibi  0t  - Utbt  . 

o, j,  o,  at 

From  (9b),  Qq,  = Qi  = . . . = Q^;  therefore  using  (10),  the  formula  for  the  standardized  weights  for  a 
criterion  cluster  is 

(10a)  (Iq,  = NjOq,  (n,  o,0,  + . . . + n t®t0t). 


Multiple  Correlation  Coefficient  for  a Criterion  Cluster 

2 2 2 

l^t  Rq,,  Rj  ■ ■ .,  Rj  be  the  squared  multiple  correlation  coefficients  for  the  criterion  cluster  formed 

from  criteria  1 t and  for  the  t criteria  y,,  ....  yt,  respectively;  let  Cj  be  the  pxl  vector  of 

intercorrelations  calculated  from  the  observations  Xj  and  y;  between  the  p independent  variables  and  the 
i— th  criterion;  and  let  cq,  be  the  pxl  vector  of  intercorrelations  calculated  from  the  pooled  observations 
[Xt  X2  . . . Xj]  and  (yly2  • . . y{  ]'  between  the  p independent  variables  and  the  criterion  cluster  (1,2,..., 
t).  By  definition, 

njOjQjCj  = XjVj  - Xjujujyj  fori=l k and 

N{Oq,Qq,cq,  = (X,y,  + ...  tXjVj)-  ^ [XjUl  + ...  + X^Uj]  [u^y,  + ...  +u^yt]. 

From  (9c),  ^ IX]U|  + ...  + XjUj  I =—  XjUj  fori=l.  . . .,  t.  Therefore, 

NtOq,Qq,cq,  = n,o,QiC|  + ...  +n{atQtct. 

But  Qq,  = Qi  = . . . = Qj  so  the  validity  coefficients  for  a criterion  cluster  are 


1 
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t 


I 


1 

(10b)  cq,  = Ntaxj,(nia,c1  + . . . +1^0^). 

The  squared  multiple  correlation  coefficient  for  the  cluster 
(1.  2 t)  is 

(10c)  Rq,  = &i,cq,  = ~TT  (n,^,  + ...  + i^a^)'  (n,a,Ci  + ...  + 1^0^). 


Hypothesis  Testing 


The  critical  region  given  in  (7)  for  the  hypothesis  (6)  requires  the  calculation  of  the  error  sum  of 
squares  or  the  squared  multiple  correlation  coefficient  for  model  (lb)  when  restrictions  are  imposed  on  the 
unknown  parameters.  The  error  sum  of  squares,  ESS,  for  model  (lb)  when  there  are  no  restrictions  on  the 
unknown  parameters  is  equal  to  the  sum  of  the  error  sum  of  squares,  ESSj,  for  the  k models  (see  (3)),  i.e., 

ESS  = ESS,  + ESS2  + . . . + ESSk. 

Let  mQ  and  0*  be  the  criterion  mean  and  variance  calculated  from  the  pooled  criterion  observation  vector 
Y,  and  let  m, , . . .,  mk  be  the  criterion  means  for  yk,  respectively.  Then 


ESSj  = njO*(l-R^)  fori=l,  . . .,  k 
NmQ  = n,m,  + n2m2  + . . . + nkmk 


2 2 2 2 2 2 
Na0  = n,(ff1+m1)'+  ...  + nk(ak+mk)  - NmQ 

Therefore  the  squared  multiple  correlation,  R2 , for  (lb)  is 

No*  - ESS  = Jni (a* R, +Tn  1 ) + •••  +nk(akRk+mk>  ~ Nmo] 


(ID 


R2  = 


Na„ 


|n,(o*+m*)  + ...  +nk(ok+mk)  - Nm*J 


The  error  sum  of  squares,  ESSH,  for  (9)  is 


ESSH  = ESSq,  + ESSt+,  + .. . + ESSk 

22  2 

where  ESSq,  = Ntaq,(l— R^,).  Therefore  the  squared  multiple  correlation,  RQ,  for  (9)  is 

R^  = No  — ESSH 


N°o 

The  hypothesis  (8)  can  be  tested  at  the  a significance  level  by  computing 
'N— k(p+l)' 


(11a)  F = 


f R2  — R^N 


v(t-l)(p+l)/  V 1--R2 


and  rejecting  (8)  if  F exceeds  the  1 00(  1 — a)%  point  of  the  central  F distribution  with  (t-IXp+l)  and  N — 
k(p+l)  degrees  of  freedom. 


Application  to  a Four  Criteria  Model;  A Worked  Example 

Given  four  criteria  y , , y2 , y3 , and  y4 , where  y;  is  an  n^xl  vector  of  observations,  and  the  predictor 
matrices  X,,  X2,  X3,  and  X4,  where  Xj  is  an  njxp  matrix  of  observations  on  p independent  variables,  the 


I 
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greatest  predictive  power  is  attained  when  each  criterion  variable  is  predicted  from  its  regression  on  the 
independent  variables.  The  initial  stage,  i.e.,  Stage  4,  employs  the  following  model: 


(12)E 


"u,X,  0 0 0* 

-0,1 

ttj  Uj  +b!  Xj 

b, 

0 u2  X2  0 0 

«2 

C?2  ^2  "H)2  X2 

b2 

= 

0 0 U3X3  0 

«3 

0:3  U3  +b3  X3 

b3 

0 0 0 U4X4 

«4 

Lb4J 

CK4  U4  +b4  X4 

4x4  4x1  4x1 


Yi 

yj 

ya 

y« 


A A 

The  mvue  a-  and  b:,  for  Oj  and  bj  are  obtained  from  (2)  and  the  squared  multiple  correlation  coefficient, 
R2 , for  model  (12)  is  obtained  from  (11). 

For  Stage  3,  assume  (9b)  holds  for  XI;  X2,  X3,  and  X4.  Under  the  linear  hypothesis  01=0:2  and 
bi=b2,  the  mvue  &,2  and  b]2,  for  the  criterion  cluster  (1,2)  formed  from  criteria  1 and  2 are  (see  (10)) 


A12! 

n. 

“a  “ 

+ 

n2 

7\  “ 
tt2 

Lb-J 

(n,+n2) 

LbJ 

n,+n2 

/\ 

The  standard  weights,  2,  and  the  validities,  c,2 , for  the  cluster  (1,2)  are  (see  (10a)  and  (10b)) 


(n,+n2)a13 


(n,+n2)a13 


A A 

. (ni °iPi  + n2o202),and 


(n,aiC!  +n2a2C2),  where 


, . , 2 ,2  2,  , 2 2,  (nimi  + n2m2) 

(n,+n2)oi2  = ni(ai+mi)+n2(o2+m2)- 


ni  + n2 


The  model  used  to  obtain  these  estimates  is  (see  (9)) 


(13)  E 


u,Xt 

0 

U2  x2 

0 U3  X3 


0 0 U4  X4 


a12 
bi  2 
«3 

b3 

<*4 


Ui 

a,  2 [u2_ 

a3u3+b3X3 

04  U4  +b4  X4 


+bl2rx‘i 


with  D 


2 

The  squared  multiple  correlation  coefficient,  3 R , for  (13)  is  (from  (11)  with  k=3) 
[foi  -t-n2)(q22R22+m;2)+n3  Ofo*  +mj)+n«  +m2)-Nm2] 


,RJ  = 


Kn,  +n2Xo22  +m2  2)+n3  (o2  +m2)+n4  (a2  +m2)-Nm„ 


^.2C.2*  m,2 


_ (nim1+n2m2),  N = ni+n2+n3n4  , and 


] 


, where 


(n,+n2) 

Nm0  = nimi +n2m2 +n3m3 +n4m4  . 
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(11a)  can  now  be  used  to  test  at  the  a significance  level  the  hypothesis  HI : a,  =a2  3,1(1  hi  =b2  by  computing 


F = 


/N-4(p+l)\  /4R2  - R2  \ 

\ (P+D  j V0-4Rf)  / 


and  rejecting  HI  if  F exceeds  Fft(p+1 , N— 4(p+l)). 

For  Stage  2,  accepting  HI  as  true,  the  additional  restrictions  a3  =a4  and  b3=b4  are  imposed  and  the 
mvue,  a34  and  b34 , for  the  criterion  cluster  (3,4)  formed  from  criteria  3 and  4 are 


^4"!  "3 

«3 

n4 

V 

t n3  +ri4 

Lb3  4J 

A 

b3 

n3  +ru 

>4. 

The  standard  weights,  034,  and  the  validities,  c34,for  the  cluster  (3 ,4)  are 
**34  = . 1 x _ (n3o333 +n4a4^4),  and 


(n3+n4)a34 

1 

(n3+n4)o3 


(n3o3c3  +n4a4c4),  where 


(n3+n4)a24  = n3(a2+m2)+n4(o2+m2  ) - 


(n3m3^n4m4) 

(n3+n4) 


The  model  used  to  obtain  these  estimates  is 

0 


(14)  E 


u,X, 
u5  X2 


0 


u3X3 

U4X4 


+b. 


n?l  +b. 


[x4j 


with  D 


= o2I. 


The  squared  multiple  correlation  coefficient,  2 R2 , for  (14)  is  (from  ( 1 1 ) with  k=2) 

2 [(" ' +"2  )(ff ! J R!  2 +Tn!  J ) + ("S  +a»  )(°!  4 R3  4 +m  3 4 )-Nmol 

2 ^ — 1 . 1 

j(ni+n2)(o2j+m]j)  + (n3+n4)(a2((+m24)-Nm2  j 

2 ^ 

where  R34  = 034c34,  (n3+n4)m34=  n3m3+n4m4.  Equation  (11a)  can  now  be  used  to  test  at  the  a 
significance  levr'  the  hypothesis  

H2:  a3=a4  and  b3=b4  given  HI  is  true  by  computing 


and  rejecting  H2  if  F exceeds  Fa(p+1,  N-3(p+l)). 

Equation  (1  la)  can  also  be  used  to  test  the  hypothesis 
H3:  a,  =a2,  b|  =b2,a3  =a4,  and  b3  =b4  by  computing 
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/ N— 4(P+i)\  /4  RJ  — 2 RJ  \ 

F \ 2(p+l)  / \ (l-4Ra  / 

and  rejecting  H3  if  F exceeds  Fa(2(p+1),  N— 4(p+l)). 

For  Stage  1,  accepting  H2  as  true,  the  additional  restrictions  a,  2 =a3 , and  b,  2 =b34  are  imposed  and 
the  mvue,a1234  and  b,234>  for  the  criterion  cluster  (1,2, 3,4)  formed  from  all  four  criteria  are 


®1234 

(ni+n2) 

’*.2' 

(nj+iu) 

**34 

A 

^1234 

N 

b,2 

+ 

N 

A 

t>34 

The  standard  weights,  Pl23A,  and  the  validities,  c,  2 34 , for  the  cluster  (1 ,2,3,4)  are 
^'  234  = (ni+nJ)o1J^I2+(n3+n4)o34334  , and 


^1234  Na, 


(n,+n2)o12c12+(n3+n4)o34c34,  where 


No 


(n,+n3)(ol2+mi2)+(n3+n4)(a34+m34)-Nm  1234,  and 


Nm  1234  = n1m1+n2m2+n3m3+n4m4  . 

The  model  used  to  obtain  the  estimates  for  cluster  (1,2, 3,4)  is 


(15)  E 


yf 

»,xr 

y-L 

U2X2 

ys 

“3X3 

y* 

U4X4 

P^l  2 34~| 

M 


V 

x«" 

Vi' 

a „ 

U2 

+b , « , . 

x2 

with  D 

y2 

12  34 

U3 

1234 

X3 

ys 

U4 

X4 

y* 

The  squared  multiple  correlation  coefficient,  , rj  t for  (15)  is 


= o*I. 


2 A / 

I R ^1234  C12  34  ' 


Equation  (11a)  can  now  be  used  to  test  at  the  a significance  level  the  hypothesis 

H4:  a12=aJ4  and  b,  j=b34,  given  a,  =02.03  ^.b,^  andb3=b4  by  computing 


F=  /N-2^>\  /2R’-.r2\ 

\ (P+1)  / \0-2RJ)  / 

and  rejecting  H4  if  F exceeds  Fft(p+1 , N-2(p+l)).  The  hypothesis 
H5:  a,=02=a3^“4  and  bi=b2=b3=b4 
can  be  tested  at  the  o significance  level  by  computing 
p = /n~ 4(p+l)  \ />*  - ,R* 

\ 3(p+l)  ) \(1  -4R2) 

and  rejecting  H5  if  F exceeds  Fa(3(p+1),  N-4(p+l)). 
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APPENDIX  C:  HIER-GRP  SAMPLE  OUTPUT 


The  HIER-GRP  sample  output  provided  in  this  appendix  was  produced  by  using  the 
Univac  1108  runstream  which  precedes  it.  The  sample  analysis  involves  a grouping 
problem  with  nine  regression  equations  having  nine  predictors. 


I 

I 


| 

. 


27 
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APPENDIX  D:  HIER-GRP  SOURCE  LISTING 
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